FADI: a fault tolerant environment for open distributed computing
نویسندگان
چکیده
FADI is a complete programming environment that serves the reliable execution of distributed application programs. FADI encompasses all aspects of modern fault-tolerant distributed computing. The built-in usertransparent error detection mechanism covers processor node crashes and hardware transient failures. The mechanism also integrates user-assisted error checks into the system failure model. The nucleus non-blocking checkpointing mechanism combined with a novel selective message logging technique delivers an efficient, low-overhead backup and recovery mechanism for distributed processes. FADI also provides means for remote automatic process allocation on the distributed system nodes.
منابع مشابه
Improving the palbimm scheduling algorithm for fault tolerance in cloud computing
Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...
متن کاملA novel design with Cellular Automata for System-Under-Test in Distributed Computing
Fault tolerant computing system is a vital criterion for reliable computation in distributed computing environment. This prerequisite has initiated system-under-test (SUT) as a testing approach to investigate the possible failure in any components of distributed computing. Validation of fault tolerant procedure is often performed by injecting faults. Fault injection causes for the actual faults...
متن کاملFault Tolerant Parallel Image Generation on a Workstation Network
Image generation for computer movies is a good candidate application for parallelisation. This application was used as a starting point to design a fault tolerant distributed computing environment aimed to run parallel applications. The paper rst describes the context of this work, then it presents the requirements that the environment should meet. The paper then describes the use of the Distri...
متن کاملData Replication Model For Remote Procedure Call Transactions
Remote Procedure Call (RPC) is the most popular model to facilitate the building of distributed programs. However, it only provides a restricted availability of update operations and does not support fault-tolerant. The combination of RPC and data replication techniques has been developed to support fault-tolerant from Remote Procedure Call (RPC). However, different data replication techniques ...
متن کاملFault Tolerant DNA Computing Based on Digital Microfluidic Biochips
Historically, DNA molecules have been known as the building blocks of life, later on in 1994, Leonard Adelman introduced a technique to utilize DNA molecules for a new kind of computation. According to the massive parallelism, huge storage capacity and the ability of using the DNA molecules inside the living tissue, this type of computation is applied in many application areas such as me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEE Proceedings - Software
دوره 147 شماره
صفحات -
تاریخ انتشار 2000